Guide to Data and Programs for 
Trade Liberalization and Firm Productivity: The Case of India

Authors: Petia Topalova and Amit Khandelwal


There are 2 datasets needed to replicate the results in the manuscripts. Both dataset are in STATA format.

prod_dataregression.dta:  This is the main dataset of the paper. The unit of observation is company / year. It is used to generate Tables 1, 3-9 and Web Appendix Tables 2 and 3.
Note that columns (8) in Tables 4a, 4b and Table 5 cannot be replicated with the data and code provided, as these are estimated in SAS with bootstrapped standard errors. The code for these columns is available from the authors upon request.

industrydata.dta:  This dataset is used to generate Table 2. The unit of observation is traded industry. 

The stata program Tables_restat.do generates all results in the manuscript (except columns (8) in Table 4a, 4b and 5 as mentioned above). Note that the path to the data files in line 20 will need to be changed. 
